Acoustic and Lexical Modeling Techniques for Accented Speech Recognition

نویسندگان

  • Udhyakumar Nallasamy
  • Alan W. Black
  • Monika Woszczyna
چکیده

Speech interfaces are becoming pervasive among the common public with the prevalence of smart phones and cloud-based computing. This pushes Automatic Speech Recognition (ASR) systems to handle wide range of environments including different channels, noise conditions and speakers with varying accents. This thesis focuses on the impact of speakers’ accents on the ASR models and techniques to make them robust to such variations. State-of-the-art large vocabulary ASRs perform poorly when presented with accented speech, that is either unseen or under-represented in the training data. Current approaches to handle accent variations mainly involve adaptation of acoustic models or the pronunciation dictionary. This thesis examines novel adaptation algorithms capable of modeling changes in phonological realizations, that uniquely characterize accent variations. Techniques that can exploit the contemporary availability of extensive, albeit unlabeled data resources are also investigated. We design experiments under various scenarios where accent adaptation is critical for speech recognition. In target accent adaptation setup, a source ASR trained on resourcerich accent(s) is adapted to a target accent with limited adaptation data. We propose semi-continuous decision tree adaptation and multi-gram pronunciation models to efficiently model the pronunciation changes between source and target accents. Active and semi-supervised learning are studied to extend the improvements obtained from supervised adaptation. We introduce relevance criteria based data selection to sample additional accent-specific data from large, unlabeled speech corpora with multiple accents. Finally, we generalize the target accent adaptation techniques to handle multiple accents in the training set. We formulate an accent adaptive training framework using factorized models with shared canonical parameters and accent-specific modules. Our proposed algorithms will be evaluated on Arabic and English accents and compared against existing adaptation techniques.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accent detection and speech recognition for Shanghai-accented Mandarin

As speech recognition systems are used in ever more applications, it is crucial for the systems to be able to deal with accented speakers. Various techniques, such as acoustic model adaptation and pronunciation adaptation, have been reported to improve the recognition of non-native or accented speech. In this paper, we propose a new approach that combines accent detection, accent discriminative...

متن کامل

Acoustic and phonetic confusions in accented speech recognition

Accented speech recognition is more challenging than standard speech recognition due to the effects of phonetic and acoustic confusions. Phonetic confusion in accented speech occurs when an expected phone is pronounced as a different one, which leads to erroneous recognition. Acoustic confusion occurs when the pronounced phone is found to lie acoustically between two baseform models and can be ...

متن کامل

Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model

One of the key challenges involved in building statistical automatic speech recognition (ASR) systems is modeling the relationship between subword units or “lexical units” and acoustic feature observations. To model this relationship two types of resources are needed, namely, acoustic resources i.e., speech data with word level transcriptions and lexical resources where each word is transcribed...

متن کامل

Partial Change Accent Models Speech Recog

Regional accents in Mandarin speech result mostly from partial phone changes due to the interlanguage system of non-native speakers. We propose partial change accent models based on accent-specific units with acoustic model reconstruction for accented Mandarin speech recognition. We use phonological rules of dialectical pronunciations together with likelihood ratio test to model actual accented...

متن کامل

Approaches to foreign-accented speaker-independent speech recognition

Current research in the area of foreign-accented speech recognition focusses either on acoustic model adaptation or speakerdependent pronunciation variation modeling. In this paper both approaches are applied in parallel and in a speaker-independent fashion: the acoustic modeling part is based on a derived Hidden Markov Model (HMM) clustering algorithm and the lexicon adaptation is based on spe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012